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Two distinct distribution functions P sp (m) and P n 



of the scaled largest cluster sizes m are 



obtained at the percolation threshold by numerical simulations, depending on the condition whether 
the lattice is actually spanned or not. With R(p c ) the spanning probability, the total distribution of 
the largest cluster is given by Ptot(m) — R(p c )P sp (m) + (1 — R(p c ))P n3 (m). The three distributions 
apparently have similar forms in three and four dimensions while in two dimensions, Ptot(m) does 
not follow a familiar form. By studying the first and second cumulants of the distribution functions, 
the different behaviour of Ptotijn) in different dimensions may be quantified. 



Much has been investigated regarding the distribution 
of cluster sizes as far as percolating clusters are concerned 
jl]-[| . There have been some recent studies on the largest 
cluster size distribution below criticality and also distri- 
bution of smaller clusters at the percolation threshold 
The largest cluster size distributions, an example 
of extreme value statistics, is relevant for several physical 
phenomena like fracture and breakdown. One can obtain 
distribution functions for the spanning cluster as well as 
for smaller clusters though the form of the distribution, 
especially that of the percolating clusters is not simple. 
In general, it is a non-Gaussian function. 

Let p be the probability that a site is occupied in a lat- 
tice. At the percolation threshold p c , there may or may 
not exist a spanning cluster. The spanning probability 
depends on many factors like the kind of percolation (site 
or bond), type of lattice, boundary conditions etc. ||. 
Spanning will occur with a certain probability less than 
one, and in the study of spanning or percolating clusters 
at p c , only those cases where the lattice spans are taken 
into account. On the other hand, for the largest cluster 
size distribution, the calculations will include all the con- 
figurations whether the lattice spans or not. The largest 
cluster, in fact, enjoys a double role in the sense that 
it may or may not happen to be the spanning cluster. 
The probability of the smaller clusters being the span- 
ning cluster is relatively much smaller. Even when the 
lattice is spanned, the largest cluster may or may not be 
the spanning one. 

The event of the lattice being spanned or not cannot be 
predicted a-priori in random percolation. However, the 
consequence of the lattice being spanned or not would di- 
rectly be reflected by the nature of the distribution of the 
largest cluster which assumes different roles for the two 
cases. We find that the two distributions of the scaled 
mass or size m of the largest cluster, denoted by P sp (m) 
and P ns (to) for the spanning case and the non-spanning 
case respectively, are indeed different. The scaled size 
to = M I L D , where the mass of the largest cluster is M 
in a lattice of linear dimension L and D is the fractal 
dimension. The latter is related to the exponents of per- 
colation and is same for all clusters when they are ranked 



The total distribution, which can be independently 
computed, is actually the weighted sum of the two dis- 
tributions: 

P tat (m) = R( Pc )P sp (m) + (1 - R{ Pc ))P ns {m), (1) 

where R Pc is the spanning probability at p c . We in- 
vestigate the nature of the distributions for hypercubic 
lattices in two, three and four dimensions, where the 
Hoshen-Kopelman algorithm |l0| is used. Free boundary 
condition has been used in all dimensions, except that in 
two dimensions we have also used helical boundary con- 
ditions for comparison. In all cases, spanning has been 
considered from top to bottom. The largest lattice sizes 
considered are L — 1600, L = 98 and L = 27 in two, three 
and four dimensions respectively with typically 10 6 and 
10 5 configurations generated for the smallest and largest 
lattice sizes. 

We first compare the distribution of the largest and 
the spanning cluster sizes in percolating two-dimensional 
lattices (Fig. 1) and find that they are numerically indis- 
tinguishable almost always except for some cases where 
the size of the clusters are very small. One can ignore 
that difference and assume that at least for large values, 
the spanning cluster and the largest clusters are identi- 
cal. However, for consistency, we will consider the largest 
cluster in the spanning case strictly and not the spanning 
cluster if they are different. This also takes care of the 
fact that we will avoid the ambiguity arising due to the 
existence of more than one spanning cluster, which may 
happen in very few cases. 

Figures 2-4 show the three distributions for two to four 
dimensions with free boundary conditions. For clarity, 
we have shown the distribution for a single lattice size in 
three and four dimensions, which represents the scaling 
distribution. Certain features are clear from the figures: 
the total distribution in two dimensions is quite different 
in shape compared to those in three and four dimensions. 
The distribution when the lattice is not spanning, P ns , 
is much more sharply peaked in the higher dimensions 
while the width of P sp , the distribution for the spanning 
case, is of the same order in different dimensions. These 
features will be confirmed quantitatively later on. The 
forms of the two distributions P sp and P ns are clearly 
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different and it should be noted that it is not possible to 
get a collapse by any trivial scale transformation. 

The distribution function for the percolating clusters 
has been fit to the following forms 0-p|: a power law- 
exponential function 

f{x) = ax b exp(-cx d ), (2) 

or a double exponential function 

f(x) = aexp(— bx~ c ) exp(— dx e ). (3) 

What is important is the appearance of a number of 
parameters in both the functional forms and estimates 
from numerical results may turn messy and involve large 
errors. The focus of the present study is, however, not to 
obtain the precise form of the function but to compare 
the gross features of the distributions. Even without a 
detailed study, it is obvious that the total distribution 
P to t in two dimensions has a form very different from 
that of the individual distributions. For three or four 
dimensions, although the distributions for the spanning 
and non-spanning cases are distinct, the total distribu- 
tion does not carry any signature that it was generated 
from these two. 

That the behaviour of Ptot in two dimensions is differ- 
ent becomes all the more apparent when one attempts to 
fit the distributions to familiar forms. We do this without 
emphasis on the accuracy of the estimated parameters. 
In two dimensions as well as in three and four, the two 
distributions P sp and P ns fit quite well to the form (3) 
with different values of the parameters a,b,c,d,e. For 
example, in three dimensions c ~ 2.8 and e ~ 3.5 for 
P sp and c ~ 2.2 and e ~ 1.5 for P ns , The total distribu- 
tion, however, is of the same form only in three and four 
dimensions. The values of the exponents e.g., in three 
dimensions are c ~ 2.5 and e ~ 1.5 for Ptot- Although 
in general the parameters are different, we notice that 
among the exponents, the values of c are quite close for 
the three distributions and e is perhaps same for P tot and 
P ns . In four dimensions also the values of c are compara- 
ble for the three distributions: c lies between 1.9 and 2.0 
but the values of e are quite different. In two dimensions, 
however, both c and e are widely different for P ns and 
P sp : c ~ 2.5 and ~ 1.45, e ~ 3.0 and ~ 10.8 for the two 
respectively. 

Since the boundary condition plays an important role 
in percolation problems, we also evaluate the distribu- 
tions with a different boundary condition, namely, helical 
boundary conditions, in two dimensions. The results are 
shown in Fig. 5. Here also, there are two distinct distri- 
butions P sp and P ns - The cluster sizes will obviously be 
larger for helical boundary condition (HBC) compared 
to the free boundary case (FBC) . The shift in the proba- 
bility distributions can be explained by this but what is 
remarkable is that the total distribution shows a hump 
on the left side, showing that the form of the total distri- 
bution is again not conventional. The distribution, with 
the hump on the left side has a less pronounced form of 



the plateau-like region compared to the free boundary 
case. Such a hump has also been observed for small sizes 
using a renormalisation group scheme [O and periodic 
boundary conditions. Hence one can conclude that in 
general the total distribution for the largest cluster size 
is not in a familiar form, the effect being strongest in 
open boundary case. 

In order to understand the difference in the behaviour 
of the total distribution in different dimensions we note 
the following points. The reason for the characteristic 
structure of Ptot in two dimensions must be traced back 
to the features of the two independent distributions from 
which P t ot is generated. Ptot typically has a plateau like 
region (FBC) or a weak two peaked structure (HBC). 
This could be due to two reasons: either P sp and P n8 , 
which are both peaked, have negligible overlap; or the 
width of the distributions are comparable. As the distri- 
butions are normalised, this would imply the heights of 
the peaks are comparable. The value of R(p c ) should not 
play any part as it is not particularly different in different 
dimensions. 

The overlap between the distributions are not negligi- 
ble as one can observe from the figures. In order to com- 
pare the behaviour in two, three and four dimensions, we 
evaluate the ratios r\ — m s \ ) /rrins and r-j = m% :/mls 
where is the rth cumulant of the distributions. The 
subscripts on m denote the spanning and non-spanning 
events as usual. The first measure, if high, will indicate 
that the peaks of the distribution are far apart as r± is 
a measure of the mean scaled cluster size and lies close 
to the peak. The second ratio is roughly a ratio of the 
widths of the distributions which in turn is an estimate 
of the ratio of the height of the peaks. 

We notice very interesting behaviour of the two ratios 
defined above and shown in Fig. 6. The first ratio T\ 
varies between 1.6 and 1.7 with some weak dependence 
on dimensionality, on the other hand, shows strong 
dimensional dependence. The values of n and r-i in two 
dimensions with HBC and FBC indicate that both de- 
pend on boundary conditions. In two dimensions, is 
close to 1, indicating that the peaks of the distributions 
lie at comparable heights. In higher dimensions, T2 in- 
creases and differs significantly from unity. The latter 
may therefore be exclusively responsible for the different 
behaviour of the total distributions in different dimen- 
sions. 

The observation that there exist two separate distri- 
butions for the largest cluster sizes may not appear very 
surprising. However, it has not been noted in any previ- 
ous study although distribution functions for percolation 
is a much studied problem. Also, usually in percolation, 
quantities and their distributions have the same qualita- 
tive behaviour in dimensions below the upper critical di- 
mension. The spanning cluster distributions for example, 
in different dimensions, could be fit to the same form 
The smaller clusters had distribution functions which had 
familiar form, very small clusters following a Gaussian 
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distribution presumably in two dimensions [J7|. This is 
perhaps the first time a novel feature is seen to be present 
in two dimensions and absent in higher dimensions as 
far as the total distribution for largest clusters are con- 
cerned. Even if one argues that P sp and P ns should be 
considered as the more fundamental distributions, rough 
estimates of the exponents for the distributions for these 
two showed that in two dimensions, they are different by 
a much larger margin than in higher dimensions, indicat- 
ing that the two dimensional case is markedly different 
regarding distributions of largest cluster sizes. This is an 
effect independent of boundary conditions although the 
effect may vary for different cases. 

Since the largest and the spanning clusters coincide in 
most cases, P sp may be regarded as a previously known 
distribution. Hence P ns is the new distribution obtained 
from the study. We have made a brief comparison of 
P ns with the distribution of largest cluster sizes below 
p c , as in both cases the lattice does not span. The cu- 
mulative distribution Q C um( x ) of the distribution Q(n%i) 
of the largest cluster size mi below p c is well-known and 
has the form @-§ 

Qcum(x) = Q(mi < x) ~ exp(- exp(-Aiir + A 2 )) (4) 

such that ln(— ln(<5 cllm (cc)) is a straightline when plot- 
ted versus x. We recover this behaviour but find that 
for P ns , the behaviour of the corresponding cumulative 
distribution is much more complicated. Hence one can 
conclude that P ns is a completely independent distribu- 
tion and different from any previously known distribution 
in percolation. 

We make one last remark: the existence of two distinct 
distributions at p c is not a unique feature of the largest 



cluster only, it is true for any cluster which does not span 
the lattice. But the total distribution will behave differ- 
ently depending on the rank of the cluster. 
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FIG. 1. The comparison of the distributions for the largest cluster and the spanning cluster sizes are shown in two dimensions 
for L = 600 with open boundary conditions. 
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FIG. 2. The three distributions P sp , P ns and P to t for the scaled mass in two dimensions are shown for two different system 
sizes L — 400 (represented by x, A, and ★ respectively) and L — 1000. The best fit lines of the form (3) are also shown for 
Psp and Pns • 
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Fig3 
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FIG. 3. The three distributions P ap , P n3 and Ptot for the scaled mass in three dimensions are shown for L = 60 along with 
the best fit lines of the form (3). 




FIG. 4. The three distributions P sp , P ns and P to t for the scaled mass in four dimensions are shown for L — 21 alongwith the 
best fit lines of the form (3) . 
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Fig 5 Helical 




FIG. 5. The three distributions P ap , P ns and P to t for the scaled mass in two dimensions are shown for L — 600 with helical 
boundary conditions. 
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Fie 6a Ratio of cumulants in 2-d 
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Fig 6b Ratio of cumulants in 3-d 
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Fig 6c Ratio of cumulants in 4-d 
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FIG. 6. The ratios n and r2 defined in the text are shown in two, three and four dimensions (FBC and HBC indicate free 
and helical boundary conditions respectively). 
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